Cloud-based digital pathology

ABSTRACT

A method and systems for cloud-based digital pathology include scanning received slides that include a pathology sample to produce a sample image in a shared memory, analyzing the sample image using one or more execution nodes, each including one or more processors, according to one or more analysis types to produce intermediate results, transmitting some or all of the sample image to a client device, further analyzing the sample image responsive to a request from the client device to produce a final analysis based on the intermediate results, and transmitting the final analysis to the client device.

RELATED APPLICATION INFORMATION

This application claims priority to provisional application Ser. No.61/514,143, filed on Aug. 2, 2011, to provisional application Ser. No.61/514,144, filed Aug. 2, 2011, and to provisional application Ser. No.61/514,146, filed Aug. 2, 2011, each incorporated herein by reference.This application is related to application Ser. No. TBD, Attorney DocketNo. 11026 (449-237) entitled, “INTERACTIVE ANALYTICS OF DIGITALHISTOLOGY SLIDES,” filed concurrently herewith and incorporated hereinby reference. This application is related to application Ser. No. TBD,Attorney Docket No. 11027 (449-238) entitled, “DIGITAL PATHOLOGY WITHLOW-LATENCY ANALYTICS,” filed concurrently herewith and incorporatedherein by reference.

BACKGROUND

1. Technical Field

The present invention relates to digital pathology and, moreparticularly, to image analysis performed with whole-slide imaging.

2. Description of the Related Art

Pathologists and medical doctors analyze very large digital images ofwhole histopathology slides using whole-slide imaging browsers. Suchbrowsers form a kind of virtual microscope running on a computer,allowing a user to manipulate the image in a user-friendly fashion,e.g., by panning and zooming, and implement computer-based imageanalysis on the slide. Image analysis normally operates on a portion ofthe image, called the region of interest (ROI), but the size of the ROIis often constrained by the complexity of the analysis and the availablecomputing resources. Analytics on histopathological images, includingimage processing, image analysis, and machine learning, is frequentlycomputationally intensive and cannot be performed in an interactive way.

Existing systems are desktop or web-browser based and perform viewingand limited analysis. These systems do not have the capability ofexecuting full analysis of tissues. As a result, analysis on suchsystems is not scalable, and demanding functions take too long forinteractive execution.

Furthermore, existing distributed computing systems are inadequate toaddress the needs of digital pathology, because the computation andcommunication demands may overwhelm even powerful distributed systems.Images of histology slides can be, for example, several gigabytes insize, such that it is usually infeasible to transfer such images backand forth between client and server. Moreover, the computations involvedin modern analytics can be very intensive, particularly if performed onthe entire image. As such, existing cloud servers are not optimized tohandle digital pathology services.

SUMMARY

A method for electronic pathology analysis includes scanning receivedslides that include a pathology sample to produce a sample image in ashared memory; analyzing the sample image using one or more executionnodes, each including one or more processors, according to one or moreanalysis types to produce intermediate results; transmitting some or allof the sample image to a client device; further analyzing the sampleimage responsive to a request from the client device to produce a finalanalysis based on the intermediate results; and transmitting the finalanalysis to the client device.

A system for electronic pathology analysis includes a shared memoryconfigured to store a scanned image from a received slide; one or moreexecution nodes, each including one or more processors, configured toanalyze the scanned image according to one or more analysis types toproduce intermediate results and to further analyze the scanned imageresponsive to a request from a client device to produce a final analysisbased on the intermediate results; and a network transceiver configuredto transmit some or all of the scanned image to the client device and totransmit the final analysis to the client device responsive to therequest from the client device.

These and other features and advantages will become apparent from thefollowing detailed description of illustrative embodiments thereof,which is to be read in connection with the accompanying drawings.

BRIEF DESCRIPTION OF DRAWINGS

The disclosure will provide details in the following description ofpreferred embodiments with reference to the following figures wherein:

FIG. 1 is a diagram of an interface layer and an interpretation layerfor interactive image analytics according to the present principles.

FIG. 2 is a block/flow diagram of a method for interactive imageanalysis according to the present principles.

FIG. 3 is a block/flow diagram of a method for pre-processing imagesaccording to the present principles.

FIG. 4 is a block/flow diagram of a method for interactive imageanalysis using pre-processed images according to the present principles.

FIG. 5 is a block flow diagram for dividing a region of interest intosub-sections optimized for parallel execution according to the presentprinciples.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

The present principles provide a multi-layer system that allows fordistributed processing of analytical tasks, allowing users to performanalysis in real-time with a high degree of responsiveness. Processingis separated into a user interface layer that permits a user tointeractively view and direct analysis, and an interpretation layer thatdistributes computation-heavy analysis to a back-end server or serversthat have greater computational power than the interface layer. Byfocusing computational tasks at a place other than the end-user'sterminal, the whole-slide imaging (WSI) browser may be implemented on amuch smaller device, e.g., a tablet or laptop.

Referring now in detail to the figures in which like numerals representthe same or similar elements and initially to FIG. 1, an analytic system100 according to the present principles is shown. An interface layer 102includes a user interface 106, a processor 108, and local memory 110.The user interface 106 may be a graphical user interface that includes adisplay and one or more input devices. The user interface 106 provides auser with the ability to view and manipulate digital images of, e.g.,samples for pathological analysis, said images being stored in localmemory 110 or streamed from a shared memory 114. The user interface 106directs processor 108 to perform local analysis and uses the processor108 to provide functionality such as panning and zooming in the images.For processes or analyses which are too computationally intensive to beperformed in real-time at interface layer 102, requests are sent tointerpretation layer 104. Communication between the interface layer 102and the interpretation layer 104 may be performed over, e.g., theInternet, a local area network, or via a wireless connection, using anappropriate network transceiver 111 at each layer.

The interpretation layer 104 may include one or more execution nodes112. The execution nodes 112 may represent a single computer system witha single processor, having one or more processing cores, or withmultiple processors. The execution nodes 112 may also represent multipledistinct computer systems that have been networked in, e.g., a cloudarrangement. The execution nodes 112 have access to a shared memory 114which stores the image information being used by the interface layer102. This image information may be communicated by the interface layer102 or may be stored in advance to minimize communication times andimprove responsiveness. An execution controller 116 accepts analysisrequests from the interface layer 102 and divides large analyses intomultiple sub-jobs, which the execution controller 116 then distributesto execution nodes 112 for analysis. The execution controller 116includes a scheduler configured to prioritize sub-tasks in such a way asto provide low-latency feedback to the user, optimizing for interactiveuse. Scheduling may include ordering tasks within a single processingnode 112, may include distributing the tasks between processing nodes112 for parallel execution, or may represent a combination of the two.Different types of run-time schedulers, such as Hadoop®, may be used inexecution controller 116 and may be implemented transparently to theuser. Upon completion of the analysis by the execution nodes 112, theexecution controller assembles the results of the sub-jobs into a singleanalysis or result and communicates that result back to interface layer102, where it may be stored and/or displayed to the user.

To obtain good performance, as measured by the subjective delay that theuser experiences with the interface layer 102 when requesting analyses,both computational loads and communication loads between the layers areoptimized. In one example, a user may use a pointer in user interface104 to select a portion of an image. The interface layer 102communicates the selection to the interpretation layer 104, whichautomatically transforms the freeform region selected by the user into aset of rectangular sections in the image. The number of sections, aswell as their size, is optimized to maximize processing speed on theavailable execution nodes 112 while preserving algorithmic constraintsof the selected analytics, such as a minimal resolution needed forproper detection of particular pathological indicators or features. Theresults of each rectangular section are then filtered, combined, andintegrated into a single report at the execution controller 116 and sentto interface layer 102, where the results are displayed as an overlaygraphic on the image. The whole process of selecting optimal resolutionand of splitting the analysis into sub-jobs takes place transparently,without explicit user direction.

Referring now to FIG. 2, a method for whole slide image analysis isshown. At block 202, a user at interface layer 102 interactively selectsa set of regions on an image. This may be accomplished using anyappropriate input device including, e.g., a mouse pointer used tooutline the desired region in a freeform fashion. Block 203 transmitsthe user's selection and any analysis instructions to the interpretationlayer 104, where block 204 transforms the selection region(s) intorectangular sub-sections at an appropriate size and resolution. Block204 uses execution controller 116 to optimize sub-section size toprovide a seamless, interactive experience for the user.

Block 206 transfers the image sub-sections to execution nodes 112 forprocessing according to a schedule generated by execution controller116. One exemplary schedule may assign sub-sections to execution nodes112 as they become available in a round-robin fashion. The executionnodes 112 access image data from shared memory 114 and perform theirassigned analyses. The results from each of the execution nodes 112 arecollected by block 208 at execution controller 116 to obtain a finalresult, which is then transmitted to interface layer 102. The result isthen stored in local memory 110 and displayed on the user interface 106as, e.g., an overlay on the original image. This process may be repeatedfor as many different selections as a user chooses, and for as manytypes of analysis as are available.

Embodiments described herein may be entirely hardware, entirely softwareor including both hardware and software elements. In a preferredembodiment, the present invention is implemented in software, whichincludes but is not limited to firmware, resident software, microcode,etc.

Embodiments may include a computer program product accessible from acomputer-usable or computer-readable medium providing program code foruse by or in connection with a computer or any instruction executionsystem. A computer-usable or computer readable medium may include anyapparatus that stores, communicates, propagates, or transports theprogram for use by or in connection with the instruction executionsystem, apparatus, or device. The medium can be magnetic, optical,electronic, electromagnetic, infrared, or semiconductor system (orapparatus or device) or a propagation medium. The medium may include acomputer-readable storage medium such as a semiconductor or solid statememory, magnetic tape, a removable computer diskette, a random accessmemory (RAM), a read-only memory (ROM), a rigid magnetic disk and anoptical disk, etc.

A data processing system suitable for storing and/or executing programcode may include at least one processor coupled directly or indirectlyto memory elements through a system bus. The memory elements can includelocal memory employed during actual execution of the program code, bulkstorage, and cache memories which provide temporary storage of at leastsome program code to reduce the number of times code is retrieved frombulk storage during execution. Input/output or I/O devices (includingbut not limited to keyboards, displays, pointing devices, etc.) may becoupled to the system either directly or through intervening I/Ocontrollers.

Network adapters may also be coupled to the system to enable the dataprocessing system to become coupled to other data processing systems orremote printers or storage devices through intervening private or publicnetworks. Modems, cable modem and Ethernet cards are just a few of thecurrently available types of network adapters.

In the same vein as above, the user experience can be further optimizedby pre-processing images. For example, pre-processing may includeapplying the most time-consuming types of analysis to an entire slide.In situations where computational resources for pre-processing arelimited, the system may prioritize those portions of an image that aremore likely to be of interest to the user, determined using a trainedclassifier that is trained using a large set of training images whichinclude exemplary selections of regions of interest for particular typesof analysis. Alternatively, parts of an image to be pre-processed may beselected by a human technician. Each type of analysis may have its ownregion of interest, as each image analysis will be looking for differentthings within the image. The results of pre-processing may be stored inshared memory 114 or transferred to the local memory 110 with the imageitself.

When a user requests analysis from user interface 106, cachedintermediate results are recalled and only the final analysis, which mayinvolve parameters chosen by the final user, is performed in real-time.Because the most computationally intensive steps have been performed inpre-processing, the final computation is usually much less complex andlow-latency. The user interface 106 may indicate regions of the imagethat are pre-processed by, e.g., overlaying a semi-transparent coloredgrid on the image. In this way, the user may place a request knowingwhether the operation will complete quickly (if the selection is in apre-processed region) or will take additional time to compute (if someparts of the selection fall outside the pre-processed region). It shouldbe recognized that the “grid” need not be a square or rectangular grid.The grid may instead have corners that do not align, sections that areof varying size and shape, and sections which overlap.

Referring now to FIG. 3, a method for pre-processing an image is shown.A slide is taken and imaged in block 302. In one exemplary embodiment,hospitals and histology labs may physically ship prepared glass slidesto the service provider, which scans the slides into files and thenreturns the slides to the customer. Although it is specificallycontemplated that such slides will include, e.g., a tissue sample orother type of slide used in the diagnosis of disease, it should berecognized that the present principles apply to image manipulation andanalysis for any purpose. Block 304 stores the image data in a centralstorage or shared memory 114. This storage 114 is accessible toexecution nodes 112 and may be replicated or split between the executionnodes 112 according to determinations made by execution controller 116.

Execution controller 116 determines potential regions of interest atblock 306. This may include segmenting the image into discrete sectionsand may further include performing an initial analysis on the image tolocate potions of the image that may be of particular interest whenperforming one or more of the available types of analysis. For example,the execution controller may predict a region of interest by notingchanges of color, texture, or brightness within the image that wouldsignify changes of tissue type. In one embodiment, potential regions ofinterest may be selected offline by a technician.

Having determined potential regions of interest, the executioncontroller 116 may pass information regarding said regions on toexecution nodes 112. The execution nodes 112 apply one or more forms ofanalysis to the potential region(s) of interest at block 308. Becausecomputational resources may still be limited, priority can be assignedby the execution controller 116 to regions of interest that areparticularly noteworthy (judged by, e.g., a likelihood score) and toforms of analysis that are most commonly used. Once the execution nodes112 produce a result, the result is collected by execution controller116 and stored in shared memory 114 as an intermediate result. In oneexample, tumor edges may be located automatically or by a technician bydetermining an area in an image that shows a highest density of dye.This edge region may subsequently be used for analyses such asperforming a mitotic count. In this example, regions close to the edgeof the tumor would have the highest priority. If there is additionaltime, regions of interest within the tumor could be further processedusing available resources.

Referring now to FIG. 4, a method for displaying pre-processed imagesand completing analysis is shown. Block 402 transfers the pre-processedslide image to the interface layer 102, to be stored in local memory110. The slide image may be transferred on demand from shared memory 114or may be distributed using physical media using, e.g., a DVD or a flashstorage medium. If transferred on demand, to reduce bandwidth burdens,only the portion of the image currently being viewed is transferred, andthe interface layer 102 may cache such information locally in localstorage 110 to improve performance. The transfer in block 402 may alsobe performed predictively, where image information is transferred in thebackground, without user requests, to anticipate potential panning andzooming by the user. This image may represent the original slide imageor may have received some formatting during pre-processing. Block 404transfers pre-processing information from shared memory 114 to theinterface layer 102. This information may include, e.g., overlayinformation for the image to indicate pre-processed regions and types ofanalysis that can be performed more quickly. Block 405 displays theslide image and optionally also displays pre-processed slide informationon the user interface 106. Block 406 then receives a user's request foranalysis and transfers that request to the interpretation layer 104.

Having received the user request, block 408 completes the analysis.Toward this end, execution controller 116 accesses intermediate resultsstored in shared memory 114 and determines what further analysis isneeded to meet the user's request. The remaining work is assigned toexecution nodes 112 to produce a final analysis. Block 410 transfers thefinal analysis result from shared memory 114 to the interface layer 102,where block 412 displays the results using the user interface 106. Theanalysis results may be sent all at once, or may be provided to theinterface layer 102 in a progressive fashion, with some results beingprovided immediately to provide a higher degree of responsiveness to theuser.

It should also be noted that state information on the analyses may bepreserved in broader contexts. For example, after performing ananalysis, the user may request an additional area to be analyzed, or maymerely increase or reduce the size of the current area. The stateinformation of any previous analysis may be stored as intermediateresults in shared memory 114 and used as a basis for subsequentanalyses. Without maintaining a state on both the interface layer 102and interpretation layer 104, the system would have to recompute initialanalyses together with the additional requests. Instead, the presentprinciples maintain the state of analysis and quickly compute only themissing parts, merging them with the current results.

Implementing the present principles greatly increases the speed andresponsiveness of WSI browser applications. Not only does the WSIbrowser gain the speed advantages of parallelism, but pre-processing andcommunication optimization allow the client to receive analyticalinformation in real-time providing a comfortable level of responsivenessto the user. Furthermore, by offloading analysis to a dedicated offsiteservice, the up-front cost of analysis is reduced, allowing smaller andless expensive client terminals to be used.

Referring now to FIG. 5, a more detailed explanation of the optimizationof block 204 is shown in dividing the image into sub-sections fordistributed analysis. Block 502 retrieves the image from shared memory114 and provides it to execution controller 116. Block 504 determines aminimum sub-section height and width based on, e.g., the particularneeds of the types of analysis to be applied. For example, a particulartype of analysis may have a minimum feature size or may need a minimumamount of information to reach useful conclusions. Block 506 determinesa maximum width and height for the sub-sections. This determination maybe made by taking into consideration the system resources available. Forexample, each execution node 112 may have a local memory of finite size,such that they would not be able to easily handle sub-sections largerthan that finite size. Another consideration when determining maximumsub-section size may include determining bandwidth limitations, suchthat distributing sub-sections beyond a certain size would result inunacceptable delays to the user.

Block 508 determines an optimal number of sub-sections based on, e.g.,the number of available execution nodes 112. This number may be, forexample, an integer multiple of the number of nodes 112, such thatcomputational resources are used to their fullest. Block 510 performs aninitial tiling of the image to produce a rectangular grid that includes,e.g., a user's freeform selection. Block 512 iteratively resizes thegrid elements to conform to the freeform region while meeting theabove-determined constraints. Resizing grid elements may includeremoving elements entirely as well as lengthening/widening or shrinkingparticular grid elements to more closely approximate the selectedfreeform shape. There is no constraint on the proportion of elementwidth to element height. Having divided the selected region of interestinto sub-sections, the execution controller 116 assigns one or moresub-sections to each execution node 112 for analysis.

Having described preferred embodiments of a system and method forcloud-based digital pathology (which are intended to be illustrative andnot limiting), it is noted that modifications and variations can be madeby persons skilled in the art in light of the above teachings. It istherefore to be understood that changes may be made in the particularembodiments disclosed which are within the scope of the invention asoutlined by the appended claims. Having thus described aspects of theinvention, with the details and particularity required by the patentlaws, what is claimed and desired protected by Letters Patent is setforth in the appended claims.

1. A method for electronic pathology analysis, comprising: scanningreceived slides that include a pathology sample to produce a sampleimage in a shared memory; analyzing the sample image using one or moreexecution nodes, each including one or more processors, according to oneor more analysis types to produce intermediate results; transmittingsome or all of the sample image to a client device; further analyzingthe sample image responsive to a request from the client device toproduce a final analysis based on the intermediate results; andtransmitting the final analysis to the client device.
 2. The method ofclaim 1, further comprising determining potential regions of interestwithin an image for pre-processing.
 3. The method of claim 2, whereinpotential regions of interest are determined using a trained classifierthat is trained with training images and selections from said trainingimages associated with particular forms of analysis.
 4. The method ofclaim 2, wherein analyzing the sample image is limited to areas within apotential region of interest.
 5. The method of claim 1, furthercomprising transmitting information regarding the intermediate resultsto the client device.
 6. The method of claim 5, wherein said informationregarding the intermediate results includes determined potential regionsof interest which underwent analysis for display at the interface layerin conjunction with the image.
 7. The method of claim 1, furthercomprising: receiving slides that include a pathology sample from asource; and returning said slides to the source after scanning.
 8. Themethod of claim 1, wherein the step of analyzing further comprisesdividing the sample image into sub-sections optimized for parallelexecution.
 9. The method of claim 1, wherein transmitting some or all ofthe sample image to a client device comprises streaming portions of thesample image to the client device responsive to a selected area ofviewing on the client device.
 10. The method of claim 1, wherein therequest from the client device comprises a request for analytics on aspecified portion of the sample image.
 11. A non-transitory computerreadable storage medium comprising a computer readable program, whereinthe computer readable program when executed on a computer causes thecomputer to perform steps of: scanning received slides that include apathology sample to produce a sample image in a shared memory; analyzingthe sample image using one or more execution nodes, each including oneor more processors, according to one or more analysis types to produceintermediate results; transmitting some or all of the sample image to aclient device; further analyzing the sample image responsive to arequest from the client device to produce a final analysis based on theintermediate results; and transmitting the final analysis to the clientdevice.
 12. A system for electronic pathology analysis, comprising: ashared memory configured to store a scanned image from a received slide;one or more execution nodes, each including one or more processors,configured to analyze the scanned image according to one or moreanalysis types to produce intermediate results and to further analyzethe scanned image responsive to a request from a client device toproduce a final analysis based on the intermediate results; and anetwork transceiver configured to transmit some or all of the scannedimage to the client device and to transmit the final analysis to theclient device responsive to the request from the client device.
 13. Thesystem of claim 12, further comprising an execution controllerconfigured to determine potential regions of interest within an imagefor pre-processing.
 14. The system of claim 13, wherein the executioncontroller is further configured to determine potential regions ofinterest using a trained classifier that is trained with training imagesand selections from said training images associated with particularforms of analysis.
 15. The system of claim 13, wherein execution nodesare configured to analyze only parts of the image that fall within apotential region of interest.
 16. The system of claim 12, wherein thenetwork transceiver is further configured to transmit informationregarding the intermediate results to the client device.
 17. The systemof claim 16, wherein said information regarding the intermediate resultsincludes determined potential regions of interest which underwentanalysis for display at the interface layer in conjunction with theimage.
 18. The system of claim 12, further comprising an executioncontroller configured to divide the sample image into sub-sectionsoptimized for parallel execution during analysis.
 19. The system ofclaim 12, wherein the network transceiver is further configured tostream portions of the sample image to the client device responsive to aselected area of viewing on the client device.
 20. The system of claim12, wherein the request from the client device comprises a request foranalytics on a specified portion of the sample image.